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Abstract 

In the design of algorithms, the greedy paradigm provides a powerful tool for solving effi- 
ciently classical computational problems, within the framework of procedural languages. 
However, expressing these algorithms within the declarative framework of logic-based lan- 
guages has proven a difficult research challenge. In this paper, we extend the framework of 
Datalog-like languages to obtain simple declarative formulations for such problems, and 
propose effective implementation techniques to ensure computational complexities compa- 
rable to those of procedural formulations. These advances are achieved through the use of 
the choice construct, extended with preference annotations to effect the selection of alter- 
native stable-models and nondeterministic fixpoints. We show that, with suitable storage 
structures, the differential fixpoint computation of our programs matches the complexity 
of procedural algorithms in classical search and optimization problems. 



1 Introduction 

The problem of finding efficient implementations for declarative logic-based lan- 
guages represents one of the most arduous and lasting research challenges in com- 
puter science. The interesting theoretical challenges posed by this problem are made 
more urgent by the fact that extrema and other non-monotonic constructs are 
needed to express many real-life applications, ranging from the 'Bill of Materials' 
to graph-computation algorithms. 

Significant progress in this area has been achieved on the semantic front, where 
the introduction of the well-founded model semantics and stable-model semantics 
allows us to assign a formal meaning to most, if not all, programs of practical inter- 
est. Unfortunately, the computational problems remain largely unsolved: various ap- 
proaches have been proposed to more effective computations of well-founded models 
and stable models ( |Van Gelder et ai, 1991|[Gelfond and Lifschitz, 1988| ), but these 
fall far short of matching the efficiency of classical procedural solutions for say, al- 
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gorithms that find shortest paths in graphs. In general, it is known that determining 
whether a program has a stable model is NP-complete ( |Marek and Truszczynski, 1991| ). 

Therefore, in this paper we propose a different approach: while, at the semantic 
level, we strictly adhere to the formal declarative semantics of logic programs with 
negation, we also allow the use of extended non-monotonic constructs with first or- 
der semantics to facilitate the task of programmers and compilers alike. This entails 
simple declarative formulations and nearly optimal executions for large classes of 
problems that are normally solved using greedy algorithms. 

Greedy algorithms ( |Moret and Shapiro, 1993| ) are those that solve a class of opti- 
mization problems, using a control structure of a single loop, where at each iteration 
some element judged the 'best' at that stage is chosen and it is added to the solution. 
The simple loop hints that these problems are amenable to a fixpoint computation. 
The choice at each iteration calls attention to mechanisms by which nondetermin- 
istic choices can be expressed in logic programs. This framework also provides an 
opportunity of making, rather than blind choices, choices based on some heuristic 
criterion, such as greedily choosing the least (or most) among the values at hand 
when seeking the global minimization (or maximization) of the sum of such values. 
Following these hints, this paper introduces primitives for choice and greedy selec- 
tion, and shows that classical greedy algorithms can be expressed using them. The 
paper also shows how to translate each program with such constructs to a program 
which contains only negation as nonmonotonic construct, and which defines the 
semantics of the original program. Finally, several classes of programs with such 
constructs are defined and it is shown that (i) they have stable model semantics 
(ii) they are easily identifiable at compile time, and (iii) they can be optimized for 
efficient execution — i.e., they yield the same complexities as those expected from 
greedy algorithms in procedural programs. Thus, the approach provides a program- 
mer with declarative tools to express greedy algorithms, frees him/her from many 
implementation details, yet guarantees good performance. 

Previous work has shown that many non deterministic decision problem can be 
easily expressed using the nondeterministic construct choice in logic programs ( |Sacca and Zaniolo, 1990 
|Giannotti et al., 1991| . In ( |Giannotti et al., 1999| ), we showed that while the se- 
mantic of choice requires the use of negation under total stable model semantics, a 
stable model for these programs can be computed in polynomial time. In fact, choice 
in Datalog programs stratified with respect to negation achieves DB-Ptime com- 
pleteness under genericity ( |Abiteboul et al., 19 94). In this paper, we further explore 
the ability of choice to express and support efficient computations, by specializing 
choice with optimization heuristics expressed by the choiceleast and choicemost 
predicates. Then, we show that these two new built-in predicates enable us to ex- 
press easily greedy algorithms; furthermore, by using appropriate data structures, 
the least-fixpoint computation of a program with choice-least and choice-most em- 
ulates the classical greedy algorithms, and achieves their asymptoptic complexity. 

A significant amount of excellent previous work has investigated the issue of how 
to express in logic and compute efficiently greedy algorithms, and, more in general, 
classical algorithms that require non-monotonic constructs. An incomplete list in- 
clude work by ( |Sudarshan and Ramakrishnan, 1991||Dietrich, 19 92 Sudarsh an and Ramakrishnan, 1991) 
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|Van Gelder, 1993) |Ganguly et al, 1995| ). This line of research was often motivated 
by the observation that many greedy algorithms can be viewed as optimized ver- 
sions of transitive closures. Efficient computation of transitive closures is central to 
deductive database research, and the need for greedy algorithms is pervasive in de- 
ductive database applications and in more traditional database applications such as 
the Bill of Materials (Zanio lo"et al., 1997| ). In this paper, we introduce a treatment 
for greedy algorithms that is significant simpler and more robust than previous 
approaches (including that proposed in ( |Greco et al, 1992] ) where it was proposed 
to use the choice together with the built-in predicates least and most); it also 
treats all aspects of these algorithms, beginning from their intuitive formulation, 
and ending with their optimized expression and execution. 

The paper is organized as follows. In Section 2 we present basic definitions on 
the syntax and semantics of Datalog. In Section 3, we introduce the notion of 
choice and the stable-model declarative semantics of choice programs. In Section 
4, we show how with this non-deterministic construct we can express in Datalog 
algorithms such as single-source reachability and Hamiltonian path. A fixpoint- 
based operational semantics for choice programs presented in Section 5, and this 
semantics is then specialized with the introduction of the choice-least and choice- 
most construct to force greedy selections among alternative choices. In Section 6, 
we show how the greedy refinement allow us to express greedy algorithms such as 
Prim's and Dijskra's. Finally, in Section 7, we turn to the implementation of choice, 
choice-least and choice-most programs, and show that using well-known deductive 
DB techniques, such as differential fixpoint, and suitable access structures, such as 
hash tables and priority queues, we achieve optimal complexity bounds for classical 
search problems. 

2 Basic Notions 

In this section, we summarize the basic notions of Horn Clauses logic, and its 
extensions to allow negative goals. 

A term is a variable, a constant, or a complex term of the form f(ti, ■ ■ ■ ,t n ), 
where tx,...,t n are terms. An atom is a formula of the language that is of the 
form p(ii, . . . , t n ) where p is a predicate symbol of arity n. A literal is either an 
atom (positive literal) or its negation (negative literal) . A rule is a formula of the 
language of the form 

Q * Qli • • • ) Qm- 

where Q is a atom (head of the rule) and Q\, . . . , Q m are literals (body of the rule). 
A term, atom, literal or rule is ground if it is variable free. A ground rule with 
empty body is a fact. A logic program is a set of rules. A rule without negative 
goals is called positive (a Horn clause); a program is called positive when all its rules 
are positive. A DATALOG program is a positive program not containing complex 
terms. 

Let P be a program. Given two predicate symbols p and q in P, we say that 
p directly depends on q, written p -< q if there exists a rule r in P such that p is 
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the head predicate symbol of r and q occurs in the body of r. The binary graph 
representing this relation is called the dependency graph of P. The maximal strong 
components of this graph will be called recursive cliques. Predicates in the same 
recursive clique are mutually recursive. A rule is recursive if its head predicate 
symbol is mutually recursive with some predicate symbol occurring in the body. 

Given a logic program P, the Herbrand universe of P, denoted Hp, is the set of 
all possible ground terms recursively constructed by taking constants and function 
symbols occurring in P. The Herbrand Base of P, denoted Bp, is the set of all 
possible ground atoms whose predicate symbols occur in P and whose arguments 
are elements from the Herbrand universe. A ground instance of a rule r in P is a 
rule obtained from r by replacing every variable X in r by a ground term in Hp. 
The set of ground instances of r is denoted by ground(r); accordingly ground(P) 
denotes {J r€P ground(r). A (Herbrand) interpretation I of P is any subset of Bp. 
An model M of P is an interpretation that makes each ground instance of each rule 
in P true (where a positive ground atom is true if and only if it belongs to M and a 
negative ground atom is true if and only if it does not belong to M — total models). 
A rule in ground(P) whose body is true w.r.t. an interpretation / will also be called 
fireable in /. Thus, a model for a program can be constructed by a procedure that 
starts from I :— % and adds to / the head of a rule r E ground(P) that is Arable 
in I (this operation will be called firing r) until no Arable rules remain. A model of 
P is minimal if none of its proper subsets is a model. Each positive logic program 
has a unique minimal model which defines its formal declarative semantics. 

Given a program P and an interpretation M for P, we denote as groundM(P) 
the program obtained from ground(P) by 

1. removing every rule having as a goals some literal ->q with q G M 

2. removing all negated goals from the remaining rules. 

Since ground m{P) is a positive program, it has a unique minimal model. A model 
M of P is said to be stable when M is also the minimum model of groundM(P) 
elfond and Lifschitz, 19880 . A given program can have one or more stable (total) 
model, or possibly none. Positive programs, stratified programs ( |Apt et al, 19880 , 

locally stratified programs ( |Przymusmski, 19880 and weakly stratified programs ( |Przymusinska and Przymusinski,~ 
are among those that have exactly one stable model. 

Let I be an interpretation for a program P. The immediate consequence operator 
Tp(I) is defined as the set containing the heads of each rule r € ground(P) s.t. all 
positive goals of r are in I, and none of the negated goals of r, is in I. 



3 Nondeterministic Reasoning 

Say that our university database contains a relation student (Name, Major, Year), 
and a relation prof essor(Name, Major). In fact, let us take a toy example that only 
has the following facts: 

student('JimBlack', ee, senior). prof essor(ohm, ee). 

prof essor(bell, ee). 
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Now, the rule is that the major of a student must match his/her advisor's major 
area of specialization. Then eligible advisors can be computed as follows: 

elig_adv(S, P) <— student (S, Majr, Year), prof essor(P, Majr). 
This yields 

elig_adv('JimBlack', ohm). 
elig_adv('jimBlack', bell). 

But, since a student can only have one advisor, the goal choice((S), (P)) must 
be added to force the selection of a unique advisor, out of the eligible advisors, for 
a student. 

Example 3.1 

Computation of unique advisors by choice rules 

actual_adv(S, P) <— student(S, Majr, Yr), prof essor(P, Majr), 
choice((S), (P)). 

The computation of this rule gives for each student S a unique professor P 

The goal choice((S), (P)) can also be viewed as enforcing a functional dependency 
(FD) S — > P on the results produced by the rule; thus, in actual_adv, the second 
column (professor name) is functionally dependent on the first one (student name). 

The result of executing this rule is nondeterministic. It can either give a sin- 
gleton relation containing the tuple (' JimBlack', ohm) or that containing the tuple 
('JimBlack',bell). 

A program where the rules contain choice goals is called a choice program. The 
semantics of a choice program P can be defined by transforming P into a program 
with negation, foe(P), called the first order equivalent of a choice program P. 
foe(P) exhibits a multiplicity of stable models, each obeying the FDs defined by 
the choice goals. Each stable model for foe(P) corresponds to an alternative set of 
answers for P and is called a choice model for P. foe(P) is defined as follows: 

Definition 3.1 

( |Sacca and Zaniolo, 1990| The first order equivalent version foe(P) of a choice 
program P is obtained by the following transformation. Consider a choice rule r in 
P: 

r:A^B(Z), choice^), (Y x )), choice{{X k ),(Y k )). 

where 

(i) B(Z) denotes the conjunction of all the goals of r that are not choice goals, 
and 

(ii) Xi, Yi, Z, 1 < i < k, denote vectors of variables occurring in the body of r 
such that Xi n Yi = and X i7 Y t C Z. 



Then, foe(P) is constructed by transforming the original program P as follows: 
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1. Replace r with a rule r' obtained by substituting the choice goals with the 
atom chosen r (W): 

r' :A+- B(Z), chosen r (W). 
where W C Z is the list of all variables appearing in choice goals, i.e., W = 
Ui<j<k X 3 uy r 

2. Add the new rule 

chosen r (W) <— B(Z), -^diffchoice r (W). 

3. For each choice atom choice((Xi), (Y*)) (1 < i < k), add the new rule 

diffchoice r (W) <- chosen r (W'), Yj ^ Y/. 

where (i) the list of variables W is derived from W by replacing each A £ Xi 
with a new variable A' (i.e., by priming those variables), and (ii) Y ^ Y/ is 
true if A ^ A', for some variable A £ Y and its primed counterpart A' £ Y/. 

The first order equivalent version of Example 13.11 is given in Example 13.21 which 
can be read as a statement that a professor will be assigned to a student whenever 
a different professor has not been assigned to the same student. 

Example 3.2 

The first order equivalent version of the rule in Example 13. II 

actual_adv(S, P) <— student(S, Majr, Yr), prof essor(P, Majr), 
chosen(S, P). 

chosen(S,P)^ student(S, Majr, Yr), prof essor(P, Majr), 

-idiff choice(S,P). 
diff choice(S,P) <- chosen(S, P'), P ^ P'. 

In general, the program foe(P) generated by the transformation discussed above 
has the following properties ( Giannotti et al, 1991): 

• foe(P) has one or more total stable models. 

• The chosen atoms in each stable model of foe(P) obey the FDs defined by 
the choice goals. 

The stable models of foe(P) are called choice models for P. 

While the topic of operational semantics for choice Datalog programs will be 
further discussed in Sectional it is clear that choice programs can be implemented 
efficiently. Basically, the chosen atoms must be produced one-at-a-time and memo- 
rized in a table. The diff choice atoms need not be computed and stored [diff choice 
rules are not range restricted and their evaluation could produce huge results); 
rather, a goal -\ diff choice (t) can simply be checked dynamically against the table 
chosen. Since these are simple operations (actually quasi constant-time if an hash 
table is used), it follows that choice Datalog programs can be computed in poly- 
nomial time, and that rules with choice can be evaluated as efficiently as those 
without choice. 
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4 Computing with Choice 

Choice significantly extends the power of Datalog, and Datalog with stratified nega- 
tion ( |Greco et al., 1995) |Giannotti et al., 1991| ) . In this paper we consider Datalog 
with the nondeterministc construct choice, although our framework can be easily 
extended to also consider stratified negation. 

The following example presents a choice program that pairwise chains the ele- 
ments of a relation d(X), thus establishing a random total order on these elements. 

Example J^.l 

Linear sequencing of the elements of a set. The elements of the set are stored by 
means of facts of the form d(Y). 

succ(root, root). 

succ(X,Y)^- succ(_,X), d(Y), 

choice((X), (Y)), choice((Y), (X)). 

Here succ(root, root) is the root of a chain linking all the elements of d(Y). The 
transitive closure of succ thus defines a total order on the elements of d. Because of 
the ability of choice programs to order the elements of a set, Datalog with choice is 
P-time complete and can, for instance, express the parity query — i.e., determining 
if a relation has an even number of elements ( |Abiteboul et al., 1994| ). This query 
cannot be expressed in Datalog with stratified negation unless we assume that the 
underlying universe is totally ordered — an assumption that violates the data inde- 
pendence principle of genericity ( |Chandra and Harel, 1982||Abiteboul et al., 1994| ). 

The expressive power of the choice construct has been studied in ( |Giannotti et al., 1999| 
|Greco et ai, 1995| ) , where it is shown that it is more powerful than other nondeter- 
ministic constructs, including the witness operator (|Abiteboul and Vianu, 1991), 



and the original version of choice proposed in ( |Krishnamurthy and Naqvi, 1988 



which is called static-choice, to distinguish it from the dynamic choice used here 
( |Giannotti et al., 1991| ). For instance, it has been shown in {Giannotti et al., 1991| ), 
that the task of ordering a domain or computing whether a relation contains an 
even number of elements (parity query) cannot be performed by positive programs 
with static choice or the witness operator ( |Abiteboul and Vianu, 1991| ). 

In the rest of the paper, we will study nondeterministic queries combined with 
optimization criteria. For instance, our previous advisor example can be modified 
using optimized criteria to match students with candidate advisors. In the next 
example we present the general matching problem for bipartite graphs. 

Example 

Matching in a bipartite graph. We are given a bipartite graph G = ((Vi, Va), E), 
i.e. a graph where nodes are partitioned into two subset V\ and Vi and each edge 
connect nodes in V\ with nodes in Va- The problem consists to find a matching, i.e., 
a subset E' of E such that each node in V\ is joined with at most one edge in E' 
with a node in V% and vice versa. 

matching(X, Y) <- g(X, Y, C), choice((Y), (X)). 

choice((X), (Y)), choice((X), (C)). 
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Here a fact g(x,y,c) denotes that there is an edge with cost c joining the node 
x € Vi with the node y e V-*. 

In section|Bl we will consider the related optimization problem, of finding a match- 
ing such that the sum of all Cs is minimized or maximized 1 . 

Example 4-3 

Rooted spanning tree. We are given an undirected graph where an edge joining two 
nodes, say x and y, is represented by means of two facts g{x,y,c) and g{y,x,c), 
where c is the cost. A spanning tree in the graph, starting from the source node a, 
can be expressed by means of the following program: 

st(root, a, 0). 

st(X, Y, C) «- st(_, X, .), g(X, Y, C), Y ^ a, Y ^ X, 
choice((Y), (X)), choice((Y), (C)). 

To illustrate the presence of multiple total choice models for this program, take a 
simple graph G consisting of the following arcs: 

g(a,b,l). g(b,a,l). 
g(b,c,2). g(c,b,2). 
g(a, c,3). g(c,a,3). 

After the exit rule adds st(root, a, 0), the recursive rule could add st(a, b, 1) 
and st(a, c,3) along with the two tuples chosen(a, b, 1) and chosen(a, c, 3) in 
the chosen table. No further arc can be added after those, since the addition of 
st(b, c,2) or st(c,b, 2) would violate the FD that follows from choice((Y), (X)) 
enforced through the chosen table. However, since st(root, a, 0), was produced by 
the first rule (the exit rule), rather than the second rule (the recursive choice rule), 
the table chosen contains no tuple with second argument equal to the source node 
a. Therefore, to avoid the addition of st(c, a, 3) or st(b, a, 1), the goal Y/a was 
added to the recursive rule. 

By examining all possible solutions, we conclude that this program has three dif- 
ferent choice models: Mi = {st(a,b, 1), st(b, c, 2)}UG, M 2 = {st(a, b, 1), st(a, c, 3)}U 
G and M 3 = {st(a, c, 3), st(c,b, 2)} U G. 

Example 4-4 

Single-Source Reachability. Given a direct graph where the arcs are stored by means 
of tuples of the form g(x,y,c), the set of nodes reachable from a node a can be 
defined by the following program: 

reach(a, 0). 

reach(Y,C)^- reach(X,Cj), g(X,Y,C 2 ), Y ^ a, 
C = C! + C 2 , choice((Y), (C)). 

1 Given that the pair X — > Y, X — > C is equivalent to X — > Y, C, the last rule in the previous example 
can also be written as follows: 

matching(X, Y) <— g(X, Y, C), choice((Y), (X)), choice((X), (Y, C)). 
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Once the cost arguments are eliminated from these rules, we obtain the usual 
transitive-closure-like program, for which the fixpoint computation terminates once 
all nodes reachable from node a are found, even if the graph contains cycles. How- 
ever, if the choice goal were eliminated, the program of Example 14 . 41 could become 
nonterminating on a cyclic graph. 

In the next example, we have a complete undirected labeled graph G, represented 
by facts g(x,y,c), where the label c typically represents the cost of the edge. A 
simple path is a path passing through a node at most once. A Hamiltonian path 
is a simple path reaching each node in the graph. Then, a simple path can be 
constructed as follows: 

Example 4-5 

The simple path problem. When the arc from X to Y is selected, we must make sure 
that the ending node Y had not been selected and the starting node X is connected to 
some selected node. The choice constraints, and the goals spath(root, Z, 0), Y / Z 
to avoid returning to the initial node, ensure that a simple path is obtained. 

spath(root,X, 0) <— g(X, _, _), choice((), X)). 

spath(X, Y,C) <- spath(_,X, _), g(X,Y,C), spath(root, Z, 0), Y ^ Z, 

choice((X), (Y)), choice((Y), (X)), choice((Y), (C)). 

When G is a complete graph, the simple path produced by this program is 
Hamiltonian (i.e., touches all the nodes). In many applications, we need to find 
a minimum-cost Hamiltonian path; this is the Traveling Salesman Problem (TSP) 
discussed in Section EJ 

The next program presents a problem consisting in the selection of a set of el- 
ements satisfying a constraint. The optimized version of this problem is the well- 
known knapsack problem. 

5 Fixpoint Semantics 

5.1 Choice programs 

Let / be an interpretation for a program P; the immediate consequence operator 
Tp(I) is defined as the set containing the heads of each rule r £ ground(P) s.t. 
all positive goals of r are in /, and none of the negated goals of r, is in I. For 
a choice program P, with first order equivalent /oe(P), let us denote by Tp c the 
immediate consequence operator associated with the rules defining the predicate 
chosen in /oe(P) (these are the rules with the -idiff choice goals) and let Tp D 
denote the immediate consequence for all the other rules in foe(P) (for positive 
choice programs these are Horn clauses). 

Therefore, we have that, for any interpretation / of foe(P): 

T foe{P) (I)=T PD (I) U T Pc (I). 

Following ( |Giannotti et al., 1999| ) we can now introduce a general operator for 
computing the nondeterministic fixpoints of a choice program P. We will denote 
by FDp the functional dependencies defined by the choice goals in P. 
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Definition 5.1 

Given a choice program P, its nondeterministic immediate consequence operator 
^ p is a mapping from an interpretation of foe(P) to a set of interpretations of 
foe(P) defined as follows: 

& P (I) = { T p ^ D (I U AC) U AC | ACeI>(/)} (1) 

where: T P (I) = {0} if T Pc (I) = 0, and otherwise: 

T P (I) = {AC | C AC C T Pc (I) \ I and / U AC |= FD P } (2) 

with / U AC |= FDp denoting that / U AC satisfies the dependencies in FDp. 

Therefore, the ^ p operator is basically the composition of two operators. Given an 
interpretation /, the first operator computes all the admissible subsets of AC C 
Tp c (I), i.e., those where / U AC obeys the given FDs; the second operator derives 
the logical consequence for each admissible subset using the cj-power of Tp D . 

The definition of Tp(I) is such that AC is not empty iff Tp c (I) \ I is not empty; 
thus, if there are possible new choices, then at least one has to be taken. The 
operator formalizes a single step of a bottom-up computation of a choice program. 
Instead of defining the powers of ^p, it is technically more convenient to define 
directly the notion of a nondeterministic computation based on the \I/p operator. 

Observe that given the presence of the constraint, / U AC |= FDp, we can 
eliminate the ^diff choice goal from the chosen rules. In fact, if Tp' c denotes the 
immediate consequence operator for the chosen rules without the -idiffchoice 
goals, then Fp' c can replace Tp c in Equation [21 

Definition 5.2 

Given a choice program P, an inflationary choice fixpoint computation for P , is a 
sequence (I n )n>o of interpretations such that: 

i- h = 0, 

n. I n+1 E *p(I„), forn> 0. 

Inasmuch as every sequence (I n ) n >o is monotonic, it has a unique limit for n — > 
oo; this limit will be called an inflationary choice fixpoint for the choice program 
P. Thus, we have the following result: 

Theorem 5.1 

dGiannotti et al, 1991| ) Let P be a Datalog program with choice, and M a Herbrand 
interpretation for foe(P). Then M is a choice model for P iff M is an inflationary 
choice fixpoint for P. 

Moreover, the inflationary choice fixpoint is sound (every result is a choice model) 
and complete (for each choice model there is some inflationary choice fixpoint com- 
putation producing it). For logic programs with infinite Herbrand universe, an addi- 
tional assumption of fairness is needed to ensure completeness ( |Giannotti et a/.~ 999 ) . 
As customary for database queries, computational complexity is evaluated with re- 
spect to the size of the database. Then, we have the following result: 
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Theorem 5.2 

( |Giannotti et al., 1991| ) Let P be a choice Datalog program. Then, the data com- 
plexity of computing a choice model for P is polynomial time. 

Therefore, for a choice Datalog program, P, the computation of one of the stable 
models for foe(P) can be performed in polynomial time using the Choice Fixpoint 
Computation. This contrasts with the general intractability of finding stable models 
for general programs: in fact, we know that checking if a Datalog program with 
negation has a stable model is NP-complete ( Marek and Truszczynski, 1991[ > . 

Therefore, the choice construct allows us to capture a special subclass of programs 
that have a stable model semantics but are amenable to efficient implementation and 
are appealing to intuition. Implementing these programs only requires memorization 
of the chosen predicates; from these, the diffchoice predicates can be generated on- 
the-fly, thus eliminating the need to store diffchoice explicitly. Moreover, the model 
of memorizing tables to enforce functional dependencies provides a simple enough 
metaphor for a programmer to make effective usage of this construct without having 
to become cognizant on the subtleties of non-monotonic semantics. We conclude by 
mentioning that, although we are considering (positive) choice Datalog programs, 
our framework can be trivially extended to also consider stratified negation. The 
computation of a choice model for a stratified choice program can be carried out 
by partioning the program into an ordered number of suitable subprograms (called 
'strata') and computing the choice fixpoints of every stratum in their order. 

5.2 Greedy Choice 

Definition 15.11 leaves quite a bit of latitude in the computation of A (Equation 
121 . This freedom can be used to select As that have additional properties. In 
particular, we want to explore specializations of this concept that trade nondc- 
tcrministic completeness (which is only of abstract interest to a programmer) in 
return for very concrete benefits, such as expressive power and performance. For 
instance, in the specialization called Eager Choice ( |Giannotti et al., 1991| ), a max- 
imal AC is used in Equation [21 This results in a significant increase in expressive 
power, as demonstrated by the fact that negation can be emulated by eager choice 
dGiannotti et al, 199l| |Giannotti et al., 1999| ). 

In this paper, we focus on a specialization of choice called greedy choice; our 
interest in this constructs follows from the observation that it is frequently desirable 
to select a value that is the least (or the most) among the possible values and still 
satisfy the FDs defined by the choice atoms. 

A choice-least (resp. choice-most) atom is of the form choiceleast ( (X) , (C)) 
(resp. choicemost ( (X) , (C) ) ) where X is a list of variables and C is a single variable 
ranging over an ordered domain. A rule may have at most one choice-least or one 
choice-most atom. A goal choiceleast ( (X) , (C) ) (resp. choicemost ( (X) , (C) ) ) in 
a rule r can be used to denote that the FD defined by the atom choice ( (X) , (C) ) 
is to be satisfied — the declarative semantics of choice, choice-least and choice-most 
coincide. For instance, a rule of the form 
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p(X, Y, C) <- q(X, Y, C), choice((X), (Y)), choiceleast((X), (C)). 

defines the FD X — + Y, C on the possible instances of p. Thus, assuming that q is 
defined by the facts q(a, 6, 1) and q(a, c, 2), from the above rule we can derive either 
p(a,b, 1) or p(a, c, 2). Moreover, the choice-least goal introduces some heuristic in 
the computation to derive only p(a, b, 1). This means that, by using choice-least 
and choice-most predicates, we introduce some preference criteria on the stable 
models of the program. The 'greedy' fixpoint computation permit us to compute a 
'preferred' stable model. 

We can now define a choice-least rule (resp. choice-most rule) as one that contains 
one choice-least (resp. one choice-most) goal, and zero or more choice goals. More- 
over, we also assume that our programs contain either choice-least or choice- most 
rules. A program that contains choice-least rules (choice-most rules) and possibly 
other rules with zero or more choice goals is called a choice-least program (a choice- 
most program) . Choice-least and choice-most programs have dual properties; thus 
in the rest of the paper we will often mention the properties of one kind of program 
with the understanding that the corresponding properties of the other are implicitly 
defined by this duality. 

The correct computation of choice-least programs can be thus defined by special- 
izing the nondeterministic immediate consequence operator by (i) ensuring that A 
is a singleton set, containing only one element (ii) ensuring that a least-cost tuple 
among those that are candidates is chosen. 

Formally, we can use as our starting point the lazy version of choice where A is 
specialized into a singleton set 8. The specialized version of f p so derived will be 
denoted ^ p Zy ; as proven in ( Gia nnotti et at, 1991| ), the inflationary choice fixpoint 
restricted using ^ l p Zy operators still provides a sound and nondeterministically 
complete computation for the choice models of P. 

We begin by decomposing ^p Zy in three steps: 

Definition 5.3 

Lazy Immediate- Consequence Operator (LICO). 

Let P be a choice program and I an interpretation of P. Then *ffp(I) for P is 
defined as follows: 



Given an interpretation /, a set A 6 IV (J) and two tuples ti,t2 G A. We say 
that t\ < t-2 if both tuples are inferred only by choice-least rules and the cost of t\ 
is lesser than the cost of ti. Further, we denote with least(A) the set of tuples of 
A with least cost, i.e. least(A) = {t\t e A and Jki G A s.t. u < t}. 

Therefore, the implementation of greedy algorithms follows directly from replac- 
ing 5 £ ®i with 6 € least(Qj). 



^ zy {I) 



9/ 



{SeT Pc (I)\I I IU{S}\=FD P } 
{ID {6} | SeOj} U {/ | 6/ = 0} 

{t£(j)| Jer^(j)} 
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Definition 5.4 

Least-Cost Immediate- Consequence Operator. 

Let P be a choice program and I an interpretation of P. Then ty l p ast (I) for P is 
defined as follows: 



^,ieast kg ca ]2 e( j ^ e Least- Cost Immediate-Consequence Operator. 

Likewise, we have the dual definition of the Most-Cost Immediate-Consequence 
Operator. 

Definition 5.5 

Let P be a program with choice and choice-least goals. An inflationary least choice 
fixpoint computation (LFC) for P, is a sequence (I n } n >a of interpretations such 
that: 



Thus, all the tuples that do not violate the given FDs (including the FDs implied 
by least) are considered, and one is chosen that has the least value for the cost 
argument. 

Theorem 5.3 

Let P be a Datalog program with choice and choiceJeast. Then, 

1. every inflationary least choice fixpoint for P is a choice model for P. 

2. every inflationary least choice fixpoint of P can be computed in polynomial 
time. 



For the first property, observe that every computation of the inflationary least 
choice fixpoint is also a compuation of the lazy choice fixpoint. Therefore every 
inflationary least choice fixpoint for P is a choice model for P. 

The second property follows from the fact that the complexity of the inflationary 
lazy choice fixpoint is polynomial time. Moreover, the cost of selecting a tuple with 
least cost is also polynomial. Therefore, the complexity of inflationary least choice 
fixpoint is also polynomial. □ 

While the inflationary choice fixpoint computation is sound and complete with re- 
spect to the declarative stable-model semantics the inflationary least (most) choice 
fixpoint computation is sound but no longer complete; thus there are choice mod- 
els that are never produced by this computation. Indeed, rather than following a 
"don't care" policy when choosing among stable models, we make greedy selections 
between the available alternatives. For many problems of interest, this greedy pol- 
icy is sufficient to ensure that the resulting models have some important optimality 



9/ 

r^ ast (/) 



{SeT Pc (I)\I | IU{5} \=FD P } 
{/U{<5} | 5 e ;east(6/)} U {/ | 6/ = 0} 

{t£(J)| Jer<§-< (/)} 



i. Jo = 0, 

ii. I n+1 e¥£ ast (I n ), forn>0. 



Proof 
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properties, such as the minimality of the sum of cost of the edges. The model so 
constructed, will be called greedy choice models 2 . 

6 Greedy Algorithms 

In a system that adopts a concrete semantics based on least choice fixpoint, a pro- 
grammer will specify a choice-least ( (X) , (Y) ) goal to ensure that only particular 
choice models rather than arbitrary ones are produced, through the greedy selection 
of the least values of Y at each step. Thus an optimal matching in a directed graph 
problem can be expressed as follows: 

Example 6.1 

Optimal Matching in a bipartite graph 

opt_matching(X, Y) <— g(X,Y,C), choice ((Y), (X)), 

choice((X), (Y)), choiceleast((X), (C)). 

Observe that this program is basically that of Example 14 . 21 after that the choice 
goal with a cost argument has been specialized to a choice-least goal. 

The specialization of choice goals into choice-least or choice-most goals yields a 
convenient and efficient formulation of many greedy algorithms, such as Dijkstra's 
shortest path and Prim's minimum-spanning tree algorithms discussed next. 

The algorithm for finding the minimum spanning tree in a weighted graph, start- 
ing from a source node a, can be derived from the program of Example 14.31 by 
simply replacing the goal choice((Y), (C)) with choiceleast((Y), (C)) yielding the 
well-known Prim's algorithm. 

Example 6.2 
Prim's Algorithm. 

st(root, a, 0). 

st(X, Y, C) <- st(_, X, _), g(X, Y, C), Y ^ a, 

choice((Y), (X)), choiceleast((Y), (C)). 

Analogously, the algorithm for finding the shortest path in a weighted digraph, 
starting from a source node a, can be derived from the program of Example 14 . 41 by 
simply replacing the goal choice((Y), (C)) with choiceleast((Y), (C)), yielding the 
well-known Dijkstra's algorithm, below. 

Example 6.3 
Dijkstra 's algorithm. 

dj(a, 0). 

dj (Y, C) «- dj (X, d), g(X, Y, C 2 ), Y ^ a, 

C = Ci + C 2 , choiceleast((Y), (C)). 

2 In terms of relation between declarative and operational semantics, the situation is similar to 
that of pure Prolog programs, where the the declarative semantics is defined by the set of all 
legal SLD-trccs, but then one particular tree will be generated instead of others according to 
some preference criterion 
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Consider now the program of Example 14. II which chains the elements of a domain 
d(X) in an arbitrary order. Say now that a particular lexicographical order is pre- 
defined and we would like to sort the elements of d(X) accordingly. Then, we can 
write the rules as follows: 

Example 6.4 

Sequencing the elements of a relation in decreasing order. 
succ(root, root). 

succ(X,Y)^- succ(_,X), d(Y), 

choicemost((X), (Y)), choice((Y), (X)). 

Greedy algorithms often provide efficient approximate solutions to NP-complete 
problems; the following algorithm yields heuristically effective approximations of op- 
timal solutions for the traveling salesperson problem ( Papadi mitriou and Steiglitz, 1975| ). 

Example 6.5 
Greedy TSP. 

Given a complete undirected graph, the exit rule simply selects an arbitrary node 
X, from which to start the search. Then, the recursive rule greedily chooses at each 
step an arc (X, Y, C) of least cost C having X as its end node. 

spath(root,X, 0) <— node(X), choice((), X). 

spath(X, Y, C) «- spath(_, X, .), g(X, Y, C), 
spath(root, Z, 0), Y/Z, 
choice((X), (Y)), choice((Y), (X)), 
choiceleast((Y), (C)). 

Observe that the program of Example 16.51 was obtained from that of Example 
14.51 bv replacing a choice goal with its choice-least counterpart. 

Example 6.6 

While we have here concentrated on graph optimization problems, greedy algo- 
rithms are useful in a variety of other problems. For instance, in ( |Greco and Zaniolo, 1 998 ) 
it is presented a greedy solution to the well-known knapsack problem consists in 
finding a set of items whose total weight is lesser than a given value (say 100) and 
whose cost is maximum. This is an NP-completc problem and, therefore, the opti- 
mal solution requires an exponential time (assuming P ^ NP) but an approximate 
solution carried out by means of a greedy computation, which selects at each step 
the item with maximum value/weight ratio. 

In conclusion, we have obtained a framework for deriving and expressing greedy 
algorithms (such as Prim's algorithm) characterized by conceptual simplicity, logic- 
based semantics, and short and efficient programs; we can next turn to the efficient 
implementation problem for our programs. 

7 Implementation and Complexity 

A most interesting aspect of the programs discussed in this paper is that their stable 
models can be computed very efficiently. In the previous sections, we have seen 



1G 



S. Greco and C. Zaniolo 



that the exponential intractability of stable models is not an issue here: our greedy 
fixpoint computations are always polynomial-time in the size of the database. In 
this section, we show that the same asymptotic complexity obtainable by expressing 
the algorithms in procedural languages can be obtained by using comparable data 
structures and taking advantage of syntactic structure of the program. 

In general, the computation consists of two phases: (i) compilation and (ii) ex- 
ecution. All compilation algorithms discussed here execute with time complexity 
that is polynomial in the size of the programs. Moreover, we will assume, as it 
is customarily done ( |Zaniolo et al., 19 97), that the size of the database dominates 
that of the program. Thus, execution costs dominate the compilation costs, which 
can thus be disregarded in the derivation of the worst case complexities. 

We will use compilation techniques, such as the differential fixpoint computation, 
that are of common usage in deductive database systems flZaniolo et al, 1997| ). Also 
we will employ suitable storage structures, such as hash tables to support search 
on keys, and priority queues to support choice-least and choice-most goals. 

We assume that our programs consist of a set of mutually recursive predicates. 
General programs can be partitioned into a set of subprograms where rules in every 
subprogram defines a set of mutually recursive predicates. Then, subprograms are 
computed according to the topological order defined by the dependencies among 
predicates, where tuples derived from the computation of a subprogram are used as 
database facts in the computation of the subprograms that follow in the topological 
order. 

7. 1 Implementation of Programs with Choice 

Basically, the chosen atoms need to be memorized in a set of tables chosen r (one for 
each chosen r predicate). The diff choice atoms need not be computed and stored; 
rather, a goal -^diffchoice r (. . .) can simply be checked dynamically against the table 
chosen r . We now present how programs with choice can be evaluated by means of 
an example. 

Example 7.1 

Consider again Example 16.41 

si : succ(root, root). 

S2 : succ(X, Y) <— succ(_,X), d(Y), 

choicemost((X), (Y)), choice((Y), (X)). 

According to our definitions, these rules are implemented as follows: 
r\ : succ(root, root). 

r2 : succ(X, Y) <— succ(_,X), d(Y), chosen(X,Y). 

r 3 : chosen(X,Y) <- succ(_,X), g(X,Y,C), -.dif f choice(X, Y). 

r 4 : dif f choice(X, Y) «- chosen(X, Y'), Y' ^ Y. 

r 5 : diff choice(X, Y) <- chosen(X', Y), X' ^ X. 

(Strictly speaking, the chosen and diffchoice predicates should have been added 
the subscript s 2 for unique identification. But we dispensed with that, since there 



Greedy Algorithms in Datalog 



17 



is only one choice rule in the source program and no ambiguity can occur.) The 
diffchoice rules are used to enforce the functional dependencies X — > Y and Y — > X 
on the chosen tuples. These conditions can be enforced directly from the stored table 
chosen(X,Y) by enforcing the following constraints 3 : 

<- chosen(X, Y), chosen(X, Y'), Y' ^ Y. 
<- chosen(X, Y), chosen(X', Y), X' ^ X. 

that are equivalent to the two rules defining the predicate ^diffchoice. Thus, rules 
r4 and r§ are never executed directly nor is any diffchoice atom ever generated or 
stored. Thus we can simply eliminate the diffchoice rules in the computation of our 
program foe(P) = Pq U Pd- In addition, as previously observed, we can eliminate 
the goal ^diffchoice from the chosen rules without changing the definition of 
LICO (the Lazy Immediate- Consequence Operator introduced in Definition 15. 
Therefore, let P' D denote Pd after the elimination of the diffchoice rules, and let 
Pq denoted the rules in Pc after the elimination of their negated diffchoice goals; 
then, we can express our LICO computation as follows: 

©/ = {5€T P ^I)\I I IU{S}\=FD P } 

T l ™ v (I) = {IV {6} | <5g9/} U {/ | 6/ = 0} 
= { T P 1{J) | JeT P (I) } 

D 

Various simplifications can be made to this formula. For program of Example 
17.11 P' D consists of the exit rule r\ : which only needs to fired once, and of the rule 
f2, where the variables in choice goals are the same as those contained in the head. 
In this situation, the head predicate and the chosen predicate can be stored in the 
same table and Tp' D is implemented at no additional cost as part of the computation 
of chosen. 

Consider now the implementation of a table chosen r . The keys for this table 
are the left sides of the choice goals: X and Y for the example at hand. The data 
structures needed to support search and insertion on keys are well-known. For main 
memory, we can use hash tables, where searching for a key value, and inserting or 
deleting an entry can be considered constant-time operations. Chosen tuples are 
stored into a table which can be accessed by means of a set of hash indexes. More 
specifically, for each functional dependency X — ► Y there is an hash index on the 
attributed specified by the variables in X. 

7.2 Naive and Seminaive Implementations 

For Example l7.ll the application of the LICO to the empty set, yields O/ = 0; then, 
from the evaluation of the standard rules we get the set ^ p Zy {%) = {p(nil, a)}. At 
the next iteration, we compute Qi 1 and obtain all arcs leaving from node a. One of 
these arcs is chosen and the others are discarded, as it should be since they would 
otherwise violate the constraint X <-> Y. This naive implementation of W generates 



3 A constraint is a rule with empty head which is satisfied only if its body is false. 
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Algorithm, 7.1 

Semi-naive computation of a choice model. 
Input: Choice program P. 
Output: Choice model / for foe(P). 
begin 

Initialization. 

For every r € P' c set chosen r = theta r = 0; 
Set: 7:=r!r(0); 

D 

1 Repeat 

(i) Select an unmarked arbitrary r £ P'c and mark r 

(ii) Compute: 9 r — (T r (I) \ theta r ) \ conf lict(T r (l) \ theta r , chosen r ); 

(iii) Add 6 r to theta r 

Until theta r 7^ or all rules in P' c are marked; 

2 If theta r = Return 7; 

3 (i) Select an arbitrary x £ theta r , and move x from theta r to chosen r ; 

(ii) With S — {x}, delete from the selected table theta r every tuple in 
conf lict(theta r , 5). 

4 Set: I = Tp" (J U S), then unmark all P' c rules and resume from Step 1. 

D 

end. 

Fig. 1. Semi-naive computation 



no redundant computation for Example 17.11 similar considerations also hold for 
the simple path program of Example 14.51 In many situations however, tuples of 0/ 
computed in one iteration, also belong to 0/ in the next iteration, and memorization 
is less expensive than recomputation. Symbolic differentiation techniques similar to 
those used in the seminaive fixpoint computation, can be used to implement this 
improvement (Zan iolo et al., 1997| ), as described below. 

We consider the general case, where a program can have more than one mutually 
recursive choice rule and we need to use separate chosen r tables for each such rule. 
For each choice rule r, we also store a table theta r with the same attributes as 
chosen r . In theta r , we keep the tuples which are future candidates for the table 
chosen r . 

We update incrementally the content of the tables theta r as they were concrete 
views, using differential techniques. In fact, r = 9 r U theta r , where theta r is 
the table accumulation for the 'old' r tuples and 8 r is the set of 'new' r tuples 
generated using the differential fixpoint techniques. Finally, 0/ in the LICO is 
basically the union of the r for the various choice rules r. 

With P' c be the set of chosen rules in foe(P), with the ^diff choice goal re- 
moved; let T r denote the immeditate consequence operator for a rule r S P' c . Also, 
P' D will denote foe(P) after the removal of the chosen rules and of the diff choice 
rules: thus P' D is Pr> whithout the diff choice rules. 

The computation of ^ p Zy can then be expressed by means of the algorithm 
reported in Fig. 17.21 
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In fact, the basic computation performed by our algorithm is operational trans- 
lation of *i>p ZV , enhanced with the differential computation of <d r . At Step 0, the 
non-choice rules are computed strafing from the empty set. This corresponds to the 
computation of the non-recursive rules (exit rules) in all our examples, but Exam- 
ples and ^3] for which the exit rules are choice rules and are first computed at 
Step 1. 

In Step 1 and Step 3, of this algorithm, we used the function conf lict r (S, R) 
defined next. Let S and R be two union-compatible relations, whose attribute sets 
contain the left sides of the choice goals in r, i.e., the unique keys of chosen r (X and 
Y for the example at hand). Then, conf lict r (S, R) is the set of tuples in S whose 
chosen r -kcy values are also contained in R. 

Now, Step 1 brings up to date the content of the theta r table, while ensuring 
that this does not contain any tuple conflicting with tuples in chosen r . 

Symbolic differentiation techniques are used to improve the computation of T r (I)\ 
theta r in recursive rules at Step 1 (ii) ( |Zaniolo et al., 1 997). This technique is par- 
ticularly simple to apply to a recursive linear rule where the symbolic differentiation 
yields the same rule using, instead of the tuples of the whole predicate, the delta- 
tuples computed in the last step. All our examples but Examples 14 . 51 and lfi . 51 involve 
linear rule. The quadratic choice rule in Example l4.5l is differentiated into a pair of 
rules. In all examples, the delta-tuples are as follows: 

(i) The tuples produced by the exit rules at Step 

(ii) The new tuples produced at Step 4 of the last iteration. For all our examples, 
Step 4 is a trivial step where Tj^ (7 U 6) = I U 6; thus S is the new value produced 
at Step 4. 

Moreover, if r corresponds to a non-recursive, as the first rule in Examples 14.21 
and 14. 51 then this is only executed once with theta r = 0. 

At Step 2, we check the termination condition, 8/ = 0, i.e., 8 r = for all r in 
Pc 

Step 3 (ii) eliminates from theta r all tuples that conflict with the tuple 5 (in- 
cluding the tuple itself). 

Therefore, Algorithm 17.11 computes a stable model for foe(P) since it imple- 
ments a differential version of of the operator '5 p Zy , and applies this operator until 
saturation. 

We can now compute the complexity of the example programs of Section 4. For 
graphs, we denote by n and e, respectively, the number of their nodes and edges. 

Complexity of rooted spanning-tree algorithm: Example \4-&l 
The number of chosen tuples is bounded by 0(n), while the number of tuples com- 
puted by the evaluation of body rules is bounded by 0(e), since all arcs connected 
to the source node are visited exactly once. Because the cost of generating each 
such arc, and the cost of checking if this is in conflict with a chosen tuple are 0(1), 
the total cost is O(e). 
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Complexity of single-source reachability algorithm: Example \4-4\ 
This case is very similar to the previous one. The size of reach is bounded by 0(n), 
and so is the size of the chosen and theta relations. However, in the process of 
generating reach, all the edges reachable from the source node a are explored by 
the algorithm exactly once. Thus the worst case complexity is 0(e). 

Complexity of simple path: Example \4-5\ 
Again, all the edges in the graph will be visited in the worst case, yielding complexity 
O(e), where e = n 2 , according to our assumption that the graph is complete. Every 
arc is visited once and, therefore, the global complexity is 0(e), with e = n 2 . 

Complexity of a bipartite matching: Example \4- C A 
Initially all body tuples are inserted into the theta relation at cost 0(e). The 
computation terminates in 0(min(n\, 712)) = 0(n) steps, where n± and n% are, 
respectively, the number of nodes in the left and right parts of the graph. At each 
step, one tuple t is selected at cost O(l) and the tuples conflicting with the selected 
tuple are deleted. The global cost of deleting conflicting tuples is 0(e), since we 
assume that each tuple is accessed in constant time. Therefore the global cost is 
0(e). 

Complexity of linear sequencing of the elements of a set: Example \4-1\ 
If n is the cardinality of the domain d, the computation terminates in 0(n) steps. 
At each step, n tuples are computed, one tuple is chosen and the remaining tuples 
are discarded. Therefore the complexity is 0(n 2 ). 



7.3 Implementation of Choice-least/most Programs 

In the presence of choice-least (or choice-most) goals, the best alternative must be 
computed, rather than an arbitrary one chosen at random. Thus, we introduce a 
new algorithm, reported in Fig. 17.31 derived from the algorithm of Fig. 17.21 

Let us consider the general case where programs could contain three different 
kinds of choice rules: (i) choice-least rules that have one choice least goal, and zero 
or more choice goals, (ii) choice-most rules that have one choice-most goal and zero 
or more choice goals, and (iii) pure choice rules that have one or more choice goals 
and no choice- least or choice-most goals. Then Step 3 (i) in Algorithm 17. II should 
be modified as follows: 

3: (i) If r is a choice-least (choice-most) rule then select a single tuple x € theta r with 
least (most) cost; otherwise (r is pure choice rule, so) take an arbitrary x £ theta r . 
Move x from theta r to chosen r ; 

An additional optimization is however possible, as discussed next. Consider for 
instance Prim's algorithm in Example 16.21 

Say that theta contains two tuples t\ — (x, y 1; ci) and ti — (x, y 2 , c 2 ). Then, the 
following properties hold for Algorithms 1 with Step 3 (i) modified as shown above: 

• If ci < c 2 then ti is not a least-cost tuple, 
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• t\ belongs to conf lict(theta, &), if and only if t% does. 

Therefore, the presence of t% is immaterial to the result of the computation, and 
we can modify our algorithm to ensure that only t\ is kept in table theta. This 
improvement can be implemented by ensuring that the attribute Y is unique key 
for the table theta. When a new tuple t' is generated and a tuple with the same 
key value is found in theta, the tuple with the smaller cost value is entered in the 
table and the other is discarded. This reduces, the maximum cardinality of theta 
for Prim's and Dijstra's algorithm from e (number of arcs) to n, (number of nodes). 

However, the above considerations are not valid for rules containing more than 
one choice atoms. For instance, in the greedy TSP program, or the optimal matching 
program (Examples 16 . 51 and 16 . II respectively), the choice rules have the following 
choice goals: 

choice((X), (Y)), choice ( (Y), (X)), choiceleast((Y), (C)) 

Say that theta contains the following tuples: t\ = (xi,yi,Ci), t% = (xi,y 2 ,c 2 ) 
£3 = ( x 2,y2,C3), with c\ < C2 < C3. Although, C3 conflicts with C2 and has larger 
cost value, it cannot be eliminated, since it has a chance to be selected later. For 
instance, if t\ is selected first then ti will be eliminated, since it conflicts with t\. 
But £3 does not conflict with t\, and remains, to be selected next. 

Thus, the general rule is as follows: 

1. the union of the left sides of all choice goals is a unique key for theta, 

2. when a new tuple is inserted and there is a conflict on the unique key value, 
retain in theta only the tuple with the lesser cost. 

The above optimization can be carried out by modifying Step 1 (iii) in Algorithm 
17. II as follows: 

1: (iii) Add each tuple of 9 r to theta r ; when key conflicts occur, and r is a choice-least 
(choice-most) table, retain the lesser (larger) of the tuples. 

Moreover, insertion and deletion of an element from a theta table can be done 
in constant time, since we are assuming that hash indexes are available, whereas 
the selection of a least /most cost element is done in linear time. The selection of 
the least/most cost element can be done in constant time by organizing theta as 
priority queues. However the cost of insertion, deletion of least/most cost element 
from a priority queue are now logarithmic, rather than constant time. Therefore, 
when using priority queues we can improve the performance of our algorithm by 
delaying the merging r into theta r (Step 2), as to allow the elimination from 9 r 
of tuples conflicting with the new 5 selected at Step 3. Therefore, we will move one 
tuple from r to theta r (if 9 r ^ 0), in Step 1 (iii), and the remaining tuples at the 
end of Step 3 (those conflicting with S excluded) . 

Observe that the improvement performed in Step 3 reduces the max cardinality 
of tables theta r . In our Prim's algorithm the size of the table theta is reduced from 
the number of edges e to the number of nodes n. This has a direct bearing on the 
performance of our algorithm since the selection of a least cost tuple is performed 
n times. Now, if a linear search is used to find the least-cost element the global 
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Algorithm 7.2 

Greedy Semi-naive computation of a choice model. 

Input: Choice program P. 

Output: I, a greedy choice model for P. 

begin 

0: Initialization. 

For every r € P' c set chosen r = theta r = 0; 
Set: I~Tlr(<D); 

D 

1: Repeat 

(i) Select an unmarked arbitrary r £ P'c and mark r 

(ii) Compute: 9 r = {T r (I) \ theta r ) \ conf lict(T r (l) \ theta r , chosen r ); 

(iii) If r is a choice-least (choice-most) rule then select a single tuple x G 9 r with 
least (most) cost; otherwise (r is pure choice rule, so) take an arbitrary x 6 9 r . 
Move x from 6 r to theta r ; 

Until theta r ^ or all rules in P' c are marked; 
2: If theta r = Return J; 

3: (i) If r is a choice-least (choice-most) rule then select a single tuple x £ theta r 
with least (most) cost; otherwise (r is pure choice rule and) take an arbitrary 
x £ theta r . Move x from theta r to chosen r ; 

(ii) With S — {x}, delete from the selected table theta r every tuple in 
conf lict(theta r , 5). 

(iii) Add each tuple of 9 r to theta r ; when key conflicts occur, and r is a choice-least 
(choice-most) table, retain the lesser (larger) of the tuples. 

4: Set: / = T^f (I U S), using the differential fixpoint improvement, then unmark all 

D 

P'q rules and resume from Step 1. 

end. 

Fig. 2. Greedy Semi- naive computation 



complexity is 0{n x n). Similar considerations and complexity measures hold for 
Dijsktra's algorithm. 

In some cases however, the unique key improvement just describe might be of 
little or no benefit. For the TSP program and the optimal matching program, where 
the combination of both end-points is the key for theta, no benefit is to be gained 
since there is at most one edge between the two nodes. (In a database environment 
this might follow from the declaration of unique keys in the schema, and can thus 
be automatically detected by a compiler). 

Next, we compute the complexity of the various algorithms, assuming that the 
theta tables are supported by simple hash-based indexes, but there is no priority 
queue. The complexities obtained with priority queues are discussed in the next 
section. 

Complexity of Prim's Algorithm: Examvle Ui.tA 
The computation terminates in 0(n) steps. At each step, 0(n) tuples are inserted 
into the table theta, one least-cost tuple is moved to the table chosen and con- 
flicting tuples are deleted from theta. Insertion and deletion of a tuple is done in 
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constant time, whereas selection of the least cost tuple is done in linear time. Since 
the size of theta is bounded by 0(n), the global complexity is 0(n 2 ). 

Complexity of Dijkstra Algorithm: Examrtle \6JA 
The overall cost is 0(n 2 ) as for Prim's algorithm. 

Complexity of sorting the elements of a relation: Example \b'-4\ 
At each step, n candidates tuples are generated, one is chosen, and all tuples are 
eliminated from theta. Here, each new X from succ is matched with every Y, even 
when differential techniques are used. Therefore, the cost is 0(n 2 ). 

Greedy TSP: Examvle \6.5l 
The number of steps is equal to n. At each step, n tuples are computed by the 
evaluation of the body of the chosen rule and stored into the temporary relation 
9. Then, one tuple with least cost is selected (Step 1) and entered in theta all the 
remaining tuples are deleted from the relation (Step 3). The cost of inserting one 
tuple into the temporary relation is 0(1). Therefore, the global cost is 0(n 2 ). 

Optimal Matching in a directed graph: Examvle \6.1\ 
Initially all body tuples are inserted into the theta relation at cost 0(e). The 
computation terminates in 0(n) steps. At each step, one tuple t with least cost is 
selected at cost 0(e) and the tuples conflicting with the selected tuple are deleted. 
The global cost of deleting conflicting tuples is 0(e) (they are accessed in constant 
time). Therefore the global cost is 0(e x n). 

7.4 Priority Queues 

In many of the previous algorithms, the dominant cost is finding the least value in 
the table theta r , where r is a least-choice or most-choice rule. Priority queues can 
be used to reduce the overall cost. 

A priority queue is a partially ordered tables where the cost of the i th element is 
greater or equal than the cost of the (i div 2) th element | |Aho et al, 1974| ). There- 
fore, our table theta r can be implemented as a list where each node having position 
i in the list also contains (1) a pointer to the next element, (2) a pointer to the 
element with position 2 x i, and (3) a pointer to the element with position i div 2. 
The cost of finding the least value is constant-time in a priority queue, the cost of 
adding or deleting an element is log(m) where m is the number of the entries in 
the queue. 

Also, in the implementation of Step 2 (ii), a linear search can be avoided by adding 
one search index for each left side of a choice or choice-least goal. For instance, for 
Dijkstra's algorithm there should be a search index on X, for Prim's on Y. The 
operation of finding the least cost element in 9 r can be done during the generation 
of the tuples at no additional cost. Then we obtain the following complexities: 
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Complexity of Prim's Algorithm: Examvle \6. l 2\ 
The computation terminates in 0(n) steps and the size of the priority queue is 
bounded by 0(n). The number of candidate tuples is bounded by 0(e). Therefore, 
the global cost is bounded by 0(e x logn). 

Complexity of Dijkstra Algorithm: Examvle \6.Sl 
The overall cost is 0(e x logn) as for Prim's algorithm. 

Complexity of sorting the elements of a relation: Example \6.4\ 
The number of steps is equal to n. At each step, n tuples are computed, one is 
stored into theta and next moved to chosen while all remaining tuples are deleted 
from 9. The cost of each step is 0(n) since deletion of a tuple from 9 is constant 
time. Therefore, the global cost is 0(n 2 ). 

Greedy TSP: Examvle \6.fil 
Observe that theta r here contains at most one tuple. The addition of the first 
tuple into an empty priority queue, theta r , and the deletion of the last tuple from 
it are constant time operations. Thus the overall cost is the same as that without 
a priority queue: i.e. the global cost is 0(n 2 ). 

Optimal Matching in a bipartite graph: Examvle \6.1\ 
Initially, all body tuples are inserted into the theta relation at cost 0(e x log e). The 
computation terminates in 0(n) steps. At each step, one tuple t is selected and all 
remaining tuples conflicting with t arc deleted (the conflicting tuples here are those 
arcs having the same node as source or end node of the arc). The global number 
of extractions from the priority queue is O(e). Therefore, the global complexity is 
0(e x loge). 

Observe that, using a priority queues, an asymptotically optimum performance ( |Aho et al., 1974| ) 
has been achieved for all problems, but that of sorting the elements of a domain, 
Example 16.41 This problem is considered in the next section. 

7.5 Discussion 

A look at the structure of the program in Example 16 . 41 reveals that at the beginning 
of each step a new set of (x,y) pairs is generated for theta by the two goals 
succ(_, X), d(Y) which define a Cartesian product. Thus, the computation can be 
represented as follows: 

O = 7T2SUCC x d 

where succ and d are the relations containing their homonymous predicates. We 
can also represent 9 and theta as Cartesian products: 

9 = (7T2<5 x d) \ chosen = ixi& x (d\ ^chosen) 

Therefore, the key to obtaining an efficient implementation here consists in stor- 
ing only the second column of the theta relation, i.e.: 

7T2theta = d \ 7r 2 chosen 
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The operation of selecting a least-cost tuple from theta now reduces to that of 
selecting a least-cost tuple from 7r2 theta, which therefore should be implemented 
as a priority queue. 

Assuming these modications, we can now recompute the complexity of our Ex- 
ample 3, by observing that all the elements in d are added to 7r2theta once at the 
hrst iteration. Then each successive iteration eliminates one element from this set. 
Thus, the overall complexity is linear in the number of nodes. For Example 11, the 
complexity is 0{n x logn) if we assume that a priority queue is kept for ^theta 
Thus we obtaine the optimal complexities. 

No similar improvement is applicable to the other examples, where the rules 
do not compute the Cartesian product of two relations. Thus, this additional im- 
provement could also be incorporated into a smart compiler, since it is possible 
to detect from the rules whether theta is in fact the Cartesian product of its two 
subprojections. However this is not the only alternative since many existing de- 
ductive database systems provide the user with enough control to implement this, 
and other differential improvements previously discussed, by coding them into the 
program. For instance, the CVC++ users could use XY-stratified programs for 
this purpose ( |Zaniolo et al., 1993| ); similar programs can be used in other systems 
HVaghani et al, 1994| ). 

8 Conclusion 

This paper has introduced a logic-based approach for the design and implementation 
of greedy algorithms. In a nutshell, our design approach is as follows: (i) formulate 
the all-answer solution for the problem at hand (e.g., find all the costs of all paths 
from a source node to other nodes) , (ii) use choice-induced FD constraints to restrict 
the original logic program to the non-deterministic generation of a single answers 
(e.g., find a cost from the source node to each other node), and (iii) specialize 
the choice goals with preference annotations to force a greedy heuristics upon the 
generation of single answers in the choice-fixpoint algorithm (thus computing the 
least-cost paths). This approach yields conceptual simplicity and simple programs; 
in fact it has been observed that our programs are often similar to pseudo code 
expressing the same problem in a procedural language. But our approach offers 
additional advantages, including a formal logic-based semantics and a clear design 
method, implementable by a compiler, to achieve optimal implementations for our 
greedy programs. This method is based on 

• The use of chosen tables and theta tables, and of differential techniques to 
support the second kind of table as a concrete view. The actual structure of 
theta tables, their search keys and unique keys are determined by the choice 
and choice-least goals, and the join dependencies implied by the structure of 
the original rule. 

• The use of priority queues for expediting the finding of extrema values. 
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Once these general guidelines are followed (by a user or a compiler) we obtain 
an implementation that achieves the same asymptotic complexity as procedural 
languages. 

This paper provides a refined example of the power of Kowalski's seminal idea: 
algorithms = logic + control. Indeed, the logic-based approach here proposed covers 
all aspects of greedy algorithms, including (i) their initial derivation using rules 
with choice goals, (ii) their final formulation by choice-least /most goals, (iii) their 
declarative stable-model semantics, (iv) their operational (fixpoint) semantics, and 
finally (v) their optimal implementation by syntactically derived data structures 
and indexing methods. This vertically integrated, logic-based, analysis and design 
methodology represents a significant step forward with respect to previous logic- 
based approaches to greedy algorithms (including those we have proposed in the 
past flGreco et al, 1992| [Ganguly et al, 1995| )). 
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